Conformational properties of compact polymers* 
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Monte Carlo simulations of coarse-grained polymers provide a useful tool to deepen the under- 
standing of conformational and statistical properties of polymers both in physical as well as in 
biological systems. In this study we sample compact conformations on a cubic L x L x L lattice 
with different occupancy fractions by modifying a recently proposed algorithm. The system sizes 
studied extend up to A*' = 256 000 monomers, going well beyond the limits of older publications 
on compact polymers. We analyze several conformational properties of these polymers, including 
segment correlations and screening of excluded volume. Most importantly we propose a scaling law 
for the end-to-end distance distribution and analyze the moments of this distribution. It shows 
universality with respect to different occupancy fractions, i.e. system densities. We further analyze 
the distance distribution between intrachain segments, which turns out to be of great importance 
for biological experiments. We apply these new findings to the problem of chromatin folding inside 
interphase nuclei and show that - although chromatin is in a compacted state - the classical theory 
of compact polymers does not explain recent experimental results. 
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INTRODUCTION 



Compact polymers are of sienificant interest from the 
viewpoint of polymer physics [l|] . They describe the con- 
formational behavior of polymers obeying excluded vol- 
ume interactions which are brought into a bad solvent, 
meaning that the solvent molecules effectively tend to 
repell the chain molecules. This leads to the formation 
of compact arrangements of the monomers as the poly- 
mer tries to minimize the solvent chain interface. From 
a different viewpoint compact conformations also arise 
when there are attractive forces acting between the con- 
stituents of the polymer which are larger than the en- 
tropic forces trying to enhance the radius of gyration of 
the polymer. A polymer is said to be compact or globu- 
lar if its characteristic size scales with iV^/'^, resulting in 
a nearly uniform monomer concentration inside a globule 
of a certain radius scaling with N^^"^. However, besides 
the pure physical interest in compact polymers, it is also 
of great importance to study such conformations from 
the biological point of view. Many polymers in living 
organisms tend to organize in a compact way, the prime 
example are proteins 0. A lot of studies have been de- 
voted to the problem of protein folding startingfrom the 
sampling of random compact conformations [1,11] . There 
is also evidence that chromatin inside the interphase nu- 
cleus is organized at least partly in a very compacted 
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State [i,ll,[3l- 

Here we want to deepen the understanding of the 
class of compact polymers with respect to its confor- 
mational and statistical properties using a Monte Carlo 
method. Monte Carlo simulations have proven to be a 
very important technique to study ensembles of poly- 
mer chains. Much progress has been achieved for exam- 
ple by usin g sim ple lattice models to study self-avoiding 
walks [1, [qI. Tiol. [HI . Obviously, Monte Carlo techniques 
become unavoidable when analytical or exact enumera- 
tion methods fail to be feasible. Indeed, analytical meth- 
ods become very rare when studying problems where ex- 
cluded volume has to be taken into account, which is 
the case in most applications. Exact enumeration meth- 
ods 13, ill , where the whole configuration space is sam- 
pled and exact averages can be calculated, are limited to 
very small system sizes which most often do not show the 
correct asymptotic behavior for large chains. 

In the last few years several Monte Carlo algorithms 
have been proposed to study com pac t polymers based on 
the idea of Hamiltonian paths [l4l,Tl5l, lig| . A Hamiltonian 
path on some graph G with set of vertices V and edges 
8 is defined as a path which visits each vertex V & V 
exactly once. Obviously, Hamiltonian paths studied on a 
cubic lattice are prime examples of maximally compact 
polymers where the number of nearest neighbor contacts 
is maximized. However, the exact enumeration of all pos- 
sible conformations is not feasible as the computer re- 
sources needed grow exponentially with growing system 
size [H]. Thus such studies are limited to rather small 
system sizes which probably do not reflect the properties 
of compact polymers in the limit of large chain lengths 
N . Therefore, several Monte Carlo techniques have been 
developed to sample a representative ensemble of Hamil- 
tonian paths on a cubic lattice. One important aspect 
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of such an algorithm is that sampHng of conformations 
is done in an unbiased way. Two algorithms have been 



shown to fail this test [U, 116[ . Recently, Mansfield pro- 
posed an algorithm which is shown to produce unbiased 
samples to a high degree of certainty [15[ . 

The scope of this study is three-fold. First of all, we 
extend the algorithm proposed by Mansfield [3l in or- 
der to study not only maximally compact conformations 
where all lattice sites on a cubic lattice are occupied, 
but also conformations in less dense systems with den- 
sity p 1. By using a highly parallel system we generate 
chains of lengths much larger than the ones studied in 
previous publications. The largest system size for p = I 
is L = 55, the largest chain length studied is = 256 000 
for a density of p = 0.5. Secondly, we analyze several sta- 
tistical and conformational properties of these compact 
conformations. Special interest is on the distance dis- 
tribution between the end points of the chain as well as 
the distance distribution between smaller segments of the 
chain. Stunningly this quantity has not been analyzed 
in previous publications, although it is of severe impor- 
tance for biological applications as the distributions can 
be compared directly to experimental data (3]. Further- 
more, we provide a comparison of compact polymers to a 
polymer melt of equal density, which has been suggested 
to behave similarly P, |T^. In a third step, we apply the 
results of our simulational study to recent experimen- 
tal data concerning the folding of chromatin inside the 
interphase nucleus. We show that chromatin on a scale 
above 150 kb does not organize simply in a compact state 
as the behaviour of the mean square displacement with 
genomic distance might suggest [7[ , but shows important 
hallmarks of a disordered system as described by the ran- 
dom loop model 0, ■ 



THE COMPUTATIONAL METHOD 
Algorithm 

To create conformations of compact polymers on a cu- 
bic lattice we use a modified version of the algorithm 
Mansfield [l3| proposed for sampling Hamiltonian paths. 
This algorithm is a Metropolis Monte Carlo technique 
and is one of the few known algorithms except exact 
enumeration methods which is unbiased, i.e. every al- 
lowed conformation is sampled with equal probability. 
While other proposed algorithms for sampling Hamil- 
tonian paths have been shown to produce biased re- 
sults [13, m, Mansfield proved in his paper [15| ergodic- 

ity for small lattices by exact enumeration and devised a 
method providing strong evidence that the algorithm is 
ergodic for even larger lattices. 

Here we are not only interested in Hamiltonian paths, 
which only make up a subset of compact polymers but 
also in compact conformations within the finite tempera- 



ture regime where p ^ 1. Therefore we have to allow va- 
cancies on the simulation lattice. This is done by modify- 
ing the algorithm such that it also handles lattices where 
not all vertices are occupied by introducing a reptation 
step for the chain ends. 

Consider a cubic lattice of dimension L x L x L, each 
lattice side r = {x, y, z) is either occupied or unoccupied. 
The chain itself is stored in a list of lattice sites. The 
algorithm then works as follows: 

1. Randomly select one of the two ends of the chain, 
the coordinates denoted by rg. 

2. Randomly select one of the six neighbouring sites 
of te on the cubic lattice, denoted by tn. 

3. Test if rjsf lies outside the lattice. If so, proceed 
with (5), otherwise proceed with (4). 

4. Test if the lattice site tn is occupied. If the lattice 
site is occupied and lies at the head of the list, 
we reverse the part of the list lying above r jv . If te 
lies at the tail of the list, we reverse the part of the 
list lying below tn. If the lattice site is unoccupied 
we do a reptation move, i.e. we append the position 
tn to the head (if te is currently head) or to the 
tail (if te is currently tail) of the list and remove 
the other end of the list. 

5. Take the new conformation (if or if not it has 
changed) as the current configuration. 

This algorithm equals the one proposed by Mansfield for 
compact polymers with p = 1, as in this case reptation 
steps are not possible. The only change applied to the 
algorithm is that we allow for a reptation step in (4) 
whenever the lattice site r^v is unoccupied. 



Ergodicity and unbiased sampling 

The algorithm is a Monte Carlo technique, which 
means that we have to make sure that it produces un- 
biased samples. Unbiased means that a) the algorithm 
samples the complete configuration space and that b) 
each configuration is sampled with equal probability. 
Unbiased sampling is ensured if the algorithm satis- 
fies detailed balance and ergodicity. Detailed balance 
is satisfied obviously by the algorithm. However, it is 
not clear a priori that ergodicity is satisfied. There is 
strong evidence that the algorithm satisfies ergodicity for 
p = 1 [l3| , but we have to make sure that this also holds 
for our modified algorithm. The reptation algorithm on 
its own is non-ergodic [isj as a conformation can get 
trapped in a state where no more reptation moves are 
possible. However, this problem is resolved here: When- 
ever there is no free adjacent lattice site to a chain end, 
the algorithm performs a half-list reversal such that the 
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chain's end moves to another lattice site. To test the 
ergodicity of the algorithm quantitatively, we performed 
an ergodicity test on a small 3x3x3 lattice in the fol- 
lowing manner: First we enumerate all possible walks of 
given length N on the lattice by exact enumeration. Each 
of these conformations are assigned an equivalence class 
Ai. An equivalence class Ai (i = 1, . . . , M) represents all 
walks which are equal after applying translational shift 
(for p 7^ 1), one of the 47 symmetry operations on the cu- 
bic lattice or a path reversal. It is obvious to subsume all 
conformations with rotational and translational symme- 
try into one equivalence class, as we are not interested in 
properties depending on the rotational state or absolute 
position in space. To include path reversal is necessary 
for the algorithm to be ergodic for p = 1 15| . For exam- 
ple there are 51 704 equivalence classes for N = 27 and 
2 750 classes for = 10 on a 3 x 3 x 3 lattice. 

Let Pi be the probability for a conformation to fall in 
equivalence class Ai, determined by exact enumeration. 
If we then produce K independent samples by the Monte 
Carlo algorithm, the probability that k samples fall in 
equivalence class Ai then is 



\K-k 



Let qi denote the relative abundance that from our K 
samples a randomly chosen one falls into equivalence class 
Ai. If the algorithm is unbiased, then the random vari- 
able 



Pi 



Pi(i-Pi) 

K 



(1) 



is normal distributed with mean zero and variance equal 
to unity. Fig. [T] shows that the ti are in a very good ap- 
proximation normal distributed for different chain length 
N . Deviations from the normal distribution most prob- 
ably are due to the fact that the e,; are not independent. 
As there is no reason to believe that ergodicity is bro- 
ken for larger lattice sizes, our modified algorithm most 
probably satisfies ergodicity. 



Autocorrelation times 

As the algorithm only changes one bond per Monte 
Carlo step, subsequent conformations arc highly corre- 
lated. A lot of subsequent Monte Carlo steps have to be 
performed until conformations get uncorrelated. Follow- 
ing Mansfield [1^1 , we calculate the autocorrelation func- 
tion Cat^ (i) of the observable N^, which is defined as the 
number of bonds oriented along the x-direction. We then 
determine the exponential decay time of the correlations 
by a fit to the function f{x) ~ exp(— t/re^jp) and obtain 
the exponential autocorrelation time T^xpip, N). This is 



0.45 

0.4 
0.35 

0.3 
0.25 

0.2 
0.15 

0.1 
0.05 




N=27 

» N = 15 

<f N=10 

? % standard normal 




\ 



-5 -4 -3 -2 -1 1 2 3 4 5 

FIG. 1: Ergodicity test of the algorithm for polymers on a 
3 x 3 X 3-lattice. Shown is the probability distribution of e^, 
given in eq. ([T}. The distribution p(ei) is in good agreement 
with the standard normal distribution, which is evidence that 
the algorithm is ergodic. 



TABLE I: Fitting parameters for autocorrelation times. The 
exponential and integrated autocorrelation time was deter- 
mined for different system sizes L. Then for each density used 
the obtained autocorrelation times were fitted to the function 
r = cA"*. 



p 


d(^Texp ) 




d{Tint) 


c(ri„t) 


0.1 


1.07(1) 


5.07(36) 


1.05(1) 


6.2(5) 


0.5 


0.995(3) 


2.50(6) 


0.97(2) 


3.1(4) 


1 


0.954(3) 


2.32(5) 


0.926(4) 


2.94(10) 



the time scale defining how long we have to wait ini- 
tially before sampling any conformations. On the other 
hand, the integrated autocorrelation time Tint tells us 
how many Monte Carlo steps have to be carried out until 
we have a subsequent independent conformation. From 
FigElonc can see that Tint{p, N) as a function of A^ has a 
power-law behavior. We fit the correlation times to the 
function 

t{p,N) = cip)N'^^P^ 

and obtain the results shown in table HI Two conforma- 
tions can be considered uncorrelated after 2Tint Monte 
Carlo steps [3]. Based on these results we write out a 
conformation after 14A^ steps for p = OA and 6A^ steps 
for p > 0.1. 



RESULTS 

In this section we present results on the conformational 
properties of compact polymers using the algorithm de- 
scribed above. We performed extensive simulations for 
three different densities p = 0.1,0.5 and 1.0. For each 
density a broad range of system sizes has been studied. 
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FIG. 2: Integrated autocorrelation times Tint of compact poly- 
mers using the described algorithm. Integrated autocorrela- 
tion times show a power-law behavior yielding a linear depen- 
dence on A^. 



For p = 0.1 we did simulations up to L = 130, for p = 0.5 
the largest system size studied is L = 80 and for p = 1.0 
we were limited by computing time to i = 55. Thus 
the largest simulated chains arc made up of = 166 375 
monomers for p = 0.1, N = 256 000 for p = 0.5 and 
= 219 700 for p = 0.1. For each density and system 
size we sampled between 20 000 up to one million confor- 
mations, depending on the system size. 



End-to-end distance statistics 

One characteristic length scale of a polymer is given by 
the mean squared end-to-end distance, i.e. the distance 
between the two endpoints of the chain, which is often 
denoted by Obviously, (-Re) depends on the total 

length of the chain, which is denoted by the number of 
monomers N and in our case related to the system size 
L. The relation between the mean squared end-to-end 
distance and the number of monomers can be written in 
terms of a scaling law for polymers in good solvents. 



{Rl) - iV^ 



(2) 



For random walks, i.e. polymers where excluded volume 
effects are ignored, it can be shown straightforwardly that 
the scaling exponent is z/ = 0.5 [H. For self-avoiding 
walks in good solvents, where excluded volume effects 
are taken into account, the polymer is more swollen com- 
pared to the random walk. The resulting exponent is not 
known exactly, but estimated by field theoretical meth- 
ods to z/ « 0.588 [l^. A compact polymer on the other 
hand is characterized by an exponent of = 1/3 repre- 
senting a globular shape with homogeneous density. The 
scaling law {R^) = bN^^^ is also valid for the compact 
conformations studied here. Table [TTl shows values for the 



parameter b determined by a fit to the data for different 
densities p. 

While a scaling law for the end-to-end distance dis- 
tribution P{r) for self-avoiding walks has been proposed 
long ago by Fisher (20j , it remains unclear whether there 
is kind of universal scaling law for the end-to-end distri- 
bution for compact polymers as well. Moreover it is not 
known what is the functional form of this scaling func- 
tion. As L is the only length scale in our system, which is 
related to i? = ^/{R^ = bN^/^ = bp^^^L, there is a good 
chance that the distributions scale with r/R. This leads 
us to propose a scaling law for the distributions similar 
to that of a random walk or self-avoiding walk 



P(r) 



A 
R \R 



exp 



(3) 



For the random walk as well as the self-avoiding walk, the 
exponents p and S are well-known (see for example the 
book 0). We now want to determine these exponents 
for the compact polymers studied here. 

We can determine the parameters A and B by the fol- 
lowing normalization conditions 

POO POO 

/ P{r)dr = 1 / r^P{r)dr = R^ 
Jo Jo 



and obtain 



B = 



1 ^ 1 



Here r(-) denotes the Gamma function, which interpo- 
lates the factorial function. 

In Fig. [3] it is shown that the scahng with R holds very 
well for different densities p. The analytic form of P{r) 
approximates the data fairly well although deviations 
from the data are larger than for a self-avoiding walk, 
as the normalization condition above neglects the fact 
that end-to-end distances cannot extend beyond ^/2L. 

We fit the theoretical distribution function P{r) to the 
data for three different p-values and obtain: 



(4) 



Within the fitting error the exponents p and S are the 
same for different densities p, suggesting that these values 
show some universal features of compact polymers. On 
average we obtain from the above data 
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1.89(3) 
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= 2.94(4) 
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1.87(4) 
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= 2.91(6) 


p = 


1.0 : 


A* = 


1.90(3) 
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= 2.94(5) 



p = 1.889(65) 5 = 2.932(89) 



(5) 



Comparing experimental data to the distribution func- 
tion P{r) is not always the method of choice, especially 
when the number of data points is too small for creating 
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distance r/R distance r/R distance r/R 

FIG. 3: Probability distribution of the end-to-end distances for compact polymers. The distributions are scaled with root mean 
squared end-to-end distance R — ^ {R^} = aN^^^ where the parameter a is determined by a fit according to table [III For each 
density systems of different size fall on top of each other. For p = 0.1 system sizes vary from L = 10 to L = 130, for p — 0.5 
we analyzed systems from L = 10 to L = 80 and for p = 1 the lattices used range from L = 10 to L = 55. The black lines 
represent a two-parameter fit to the empirical distribution function ([3|, the fitting parameters are listed in (j4|. For comparison, 
the dotted line shows the distribution of two random points on a cubic lattice obeying excluded volume. 



TABLE II: The effective monomer size determined by a fit to 
the function (RI) = fe^iV^/^ = a^L^ for each density p. Ob- 
viously, the effective monomer size decreases with increasing 
density. 



lytic function P{r) given in eq. ([3]). Calculations yield 



p 


b 


a 


0.1 


1.50(2) 


0.696(7) 


0.5 


0.89(7) 


0.712(5) 


1 


0.71(1) 


0.71(1) 



reasonable histograms. In this case it is more useful to 
look at the first few moments of the distribution. Here 
we analyze dimensionless ratios of moments of the end- 
to-end distance distribution, having the advantage that 
no adjustable parameter is present. The ratios of interest 
here are 



C2 



C3 



C4 = 



For a random walk and a self-avoiding walk, the a are 
constants not depending on any model parameters (such 
as linker length I). Here we show that this is also the case 
for compact polymers and we determine its values. The 
ratio plots arc shown in Fig. [4l A fit yields the values 

C2 = 1.1395(5) C3 = 1.421(2) a = 1.458(1) 

Obviously, there are significant differences from a RW, 
however differences from a SAW behavior become only 
visible in the fourth order ratio C4. Wc demonstrate in a 
later section how this information can be used to charac- 
terize the behavior of biopolymers. 

Instead of determining the moments by the raw simu- 
lational data we can determine the moments by the ana- 
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(6) 
(7) 
(8) 



Plugging in the fit parameters for /i and S from eq. ^ 
we obtain 

C2 = 1.140(6) C3 = 1.43(2) C4 = 1.488(24) 

The values are compatible with the ones calculated di- 
rectly by a fit to the simulation data within the range of 
the errors. Deviations become large for larger moments 
reflecting the approximative character of the scaling func- 
tion P{r). 



Intrachain distance statistics 



In the past a lot of effort has been undertaken to s tudy 
the end-to-end distributions of self-avoiding walks [20|. 
For random walks, this problem is easily solved analyt- 
ically. In the last section we studied these distributions 
for compact polymers which has not been done so far. We 
found a similar scaling function as for random and self- 
avoiding walks with exponents which seem to be univer- 
sal for compact polymers of different densities and chain 
lengths. However, from the experimental point of view 
one is not only interested in the distance between end 
points of a compact polymer but also in the distance 
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FIG. 4: Ratios of the moments of the end-to-end distances. 
The ratios are independent of the system size L (except finite 
size effects) and seem to be independent of the density p. 



FIG. 5: This figure shows the ratios of the moments of the 
distribution of distances between two parts of the chain which 
are separated by a contour length n. 



distribution between two arbitrary monomers along the 
chain which are separated by a certain contour length n. 
For example, this quantity becomes important in experi- 
ments measuring the spatial arrangement of two floures- 
cently labelled parts of the human genome (3]. Therefore, 
we have to evaluate how the distribution changes when 
looking at intrachain segments. 

It is most interesting to look at the moment ratios as 
these ratios are easiest to compare to experimental data, 
which quite often do not provide enough data points to 
obtain a complete distribution function. For the largest 
chains simulated the moment ratios are shown in Fig. [5] 
for various contour length n. The mean values are av- 
erages both over different positions along one chain as 
well as over the set of sampled conformations C. The kth 
moment is thus evaluated as 

W = ^]^EE iira„-rf r- (9) 

' ' cec 1=1 

Here rf denotes the position of the ith monomer of the 
conformation C out of the set of sampled conformations 
C. 

For small contour length n the moment ratios are 



peaked and reach the value of a random-walk. This is 
due to the screening effect in compact polymers, which is 
in detail analyzed in one of the following sections. How- 
ever, the ratios pretty fast fall below the ratios for self- 
avoiding walks and stay mostly constant. There is only 
a small increase for the ratios where the contour length 
approaches the chain length N, indicating that the chain 
ends have more freedom for fluctuations than parts em- 
bedded in the middle of the chain. We want to stress here 
that for sufficiently large contour lengths the moment ra- 
tios for intrachain distances are smaller than those for the 
end-to-end distances. 

End-point statistics 

It is an open question, raised for example in Ref. 
whether the positions of the end points of a compact 
polymer are correlated or not. It was suspected that an 
entropic cost associated with local rearrangement around 
the chain's ends might cause some effective attraction 
or repulsion between them. Let ri = {xi,yi, zi) and 
r2 = ix2, Z2) denote the end point vectors with respect 
to the center of the simulation box. Lua et al. Ildl showed 
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then for Hamiltonian paths up to L 
point correlation coefficient 

{X1X2) 



fO that the end 



(10) 



is negative for smah lattice sizes but pretty fast ap- 
proaches the correlation between disconnected points 
only obeying excluded volume and the chess board the- 
orem [f6[. The latter theorem states that, if we mark 
adjacent vertices on the lattice graph with different col- 
ors similar to a chess board, then the end points of a 
chain with even numbers of monomers are sitting on lat- 
tice sites with different color while the end points of a 
chain with an odd number of monomers are positioned 
on lattice sites of same color. This restriction has to be 
taken into account when comparing to randomly posi- 
tioned points, as this is an inherent feature of the lattice 
model but not of the ensemble of compact polymers in 
general. However it becomes more and more negligible 
the larger the lattice size. 

Here we study the correlation coefficient c for lattice 
sizes much larger than in [3| ■ Note that the coordinates 
in eq. ([TU]) are taken with respect to the center of the 
simulation cube. Fig. [S] shows that there are negative 
correlations for all densities considered, which approach 
zero for larger lattice sizes. The only deviations are for 
p = 1 (i.e. Hamiltonian paths) and even lattice sizes, 
which obviously is an effect of the lattice geometry and 
the chess board theorem and therefore no intrinsic prop- 
erty of compact polymers. These results are in very good 
agreement with the results by Lua et al. suggesting 
that there arc no end-point correlations in the N ^ 00 
limit. 

We also analyzed the mean square displacement of the 
endpoints from the center of the cube in order to answer 
the question whether the polymer tends to arrange such 
that the endpoints predominantly locate in the center of 
the cube or at its periphery. Being located in the center 
of the cube might be disadvantageous due to entropic 
reasons. Fig. [7] shows that for p = 0.5 and p = 1 the 
points are predominantly shifted towards the periphery 
of the cube, while for p ~ 0.1 the points are located more 
in the interior of the cube. 



Correlations of intrachain segments 

Consider two arbitrary points on a polymer with coor- 
dinates ri = {xi,yi,Zi) and r2 = {x2,y2, ^2)- As for the 
end points of the chain, we can pose the question whether 
the coordinates of these points are correlated by evalu- 
ating the correlation coefficient of eq. (fTO|) We assume 
the coordinates ri and r2 to be given with respect to the 
center of mass of the polymer. A value of c = indicates 
that there is no correlation between the coordinates ri 
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FIG. 6: End-end correlation coefficients according to eq. (|10p 
for compact polymers with different densities (p — 0.1, p = 
0.5 and p — 1). Additionally shown are exact results for the 
correlation coefficient of two randomly positioned points on 
a cubic lattice only obeying excluded volume and the chess 
board theorem. 
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FIG. 7: Distance of the end points from the center of the cube 
for compact polymers with different densities. For comparison 
shown are also the results for random points obeying excluded 
volume and the chessboard theorem. 



and r2, i.e. they effectively behave like two randomly cho- 
sen points on the cubic lattice. A value c ^ indicates 
an effective attraction or repulsion. Fig. [5] shows the cor- 
relation coefficient in dependence of the contour length 
n between the segments. While for short contour length 
there are high correlations because of the connectivity 
of the chain, these correlations decay fast and for larger 
contour length correlations are nearly vanished. This is 
in stark contrast to the behavior of a self-avoiding walk, 
where (negative) correlation effects are dominant even 
on the length of the whole chain. This result is in per- 
fect agreement with the scaling theory developed by De 
Gennes [2l|, which predicts that on the length scale of 
the compact system, parts of the chain become practi- 
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0.3 0.4 0.5 0.6 0.7 
relative contour length n/N 

FIG. 8: Position correlations of monomers. This figure shows 
the correlation coefficient .l^'-f^^ , between two monomers 

in dependence of their separation along the chain. The data 
shown are for system sizes L = 55(p = = 70(p = 0.5) 
and L = 130(p = 0.1). For comparison the correlations in a 
self-avoiding walk chain of length A'' = 10 000 are also shown. 



cally independent. If the scaling theory is correct then 
the decay length of the correlations which we define 
as the length where c{nd) ~ 1/e, should be related to 
the system size L. We test this prediction by evaluat- 
ing the ratio r = (^R^{nd)) / L'^ where (^R^{nd)) is the 
mean square displacement between the end points of a 
segment of length Ud- For the largest system sizes stud- 
ied we find the values r w 0.31 for p = 1, r « 0.31 for 
p = 0.5, r K, 0.29 for p = 0.1. As r-values are nearly 
equal for systems of completely different size and density 
there is strong evidence that the decay of the position 
correlations is directly related to the system size. 



Screening of excluded volume in compact polymers 

Much is known about polymer melts, where a number 
of polymers with degree of polymerization N is placed 
in a system with volume fraction $. Below a critical 
concentration (or volume fraction) the polymers do 
not feel the existence of the others and basically behave 
like self-avoiding walks in a good solvent. This critical 
concentration $* is given by the volume fraction where 
the free coils with extension (-R^)q ~ N'^'^ begin to over- 
lap [ilj. The index indicates here the limit $ ^ 0. The 
value of the critical concentration scales like 



N 



At volume fractions above <&* the polymers begin to feel 
each other and the system can be described by the cor- 
relation length ^. On scales larger than this correlation 
length, the chains effectively behave like ideal coils, a 




5 10 
contour length n 



FIG. 9: Mean squared distances (ii^) between monomers 
separated by a contour length of n for different densities. The 
compact polymers also show a screening of excluded volume 
for large densities. 



theorem most often referred to as Flory theorem. On a 
scale smaller than ^ excluded volume effects st ill p lay a 
dominant role. By scaling arguments one finds [21| that 



Here we want to investigate the question whether there 
is a fundamental difference between a polymer melt with 
volume fraction $ and a compact polymer, i.e. a melt 
with degree of polymerization N = ^L^. Consider an 
arbitrary segment of a compact polymer of length Nm- 
We impose on Nm the condition that the extent of this 
segment must be smaller than the system size in order 
to avoid effects of the confinement to play a role. We 
now ask whether these segments of length Nm of our 
compact polymers behave different from a polymer with 
degree of polymerization N„i in a corresponding melt. 
The analysis of positional and angular correlation effects 
which are decaying pretty fast suggests that a part of the 
chain should not "feel" that it is connected with a part 
far away. 

The theory of polymer solutions predicts that there 
is a crossover from a self-avoiding walk behavior to a 

Fig. M 



22| 



random walk behavior on the short scale 
shows that this crossover becomes indeed apparent in 
the mean square displacement for short contour lengths 
in compact polymers. While for p = 0.1 we find a self- 
avoiding walk type of scaling with exponent v ~ 0.588 for 
contour lengths up to n « 50, the maximal dense system 
behaves more like a random walk with v = 0.5 indicating 
the screening of excluded volume in this system. 

To analyze the screening length we have to look in more 
detail at the structure function of parts of the chains. The 
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structure function is defined as 



J=0 



(11) 



The brackets denote a spherical average over all q- vectors 
of equal magnitude and over all conformations. The sum 
is over a subchain of length Nm whose position vectors 
are denoted by ro . . . rjv^. One expects these subchains 
to behave like random walks on distances larger than ^ 
and self-avoiding walks on distances smaller than ^, i.e. 



S{q) 



>l 



(12) 



In Fig.[Tn]the structure function is shown for chain seg- 
ments of length = 100. One can immediately see that 
the system with p = 0.1 shows a range of values, where 
excluded volume is not screened, extending much beyond 
the length scale of a single bond. On the other hand for 
p = 0.5 excluded volume interactions are screened very 
fast resulting in ideal chain behavior over a wider range 
of q-valucs. We can determine the screening length by 
performing a linear fit with slope —2 for small values 
(but beyond the scale where asymptotic behavior sets in) 
and a linear fit with slope --l/v for large g- values (but 
away from the length scale of a single bond). We then 
extract the value where the crossover between the two 
regimes occurs and obtain the following screening lengths 



/9 = 0.1 : w 0.6 

p = 0.5: g^wl.S 
p= 1.0 : 



9? 



; 10.5 
;3.5 
: 1 



For p — 1 the system seems to be that dense that ex- 
cluded volume is shielded on the order of a bond length, 
therefore the scaling regime where S{q) ~ q~^^'^ does not 
show off any more. This result does not come as a big 
surprise as the screening length ^ is related to the average 
mesh size in the system 2l| . This mesh size is - as every 
lattice site is occupied - approximately equal to unity. 



The gyration tensor 
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The shape of a a polymer is described by its gyration 
tensor. The gyration tensor is defined as 

1 ^ 

5„„ = -^r«r« (13) 

Here r^'^ is the coordinate vector of the ith monomer 
and the subindex denotes its cartesian components. The 



FIG. 10: Structure function of chain segments of length 
Nm ~ 100 for compact polymers with different densities. For 
the calculation the largest simulated system sizes for each den- 
sity was used. The value of the segment length was chosen 
such that the radius of gyration of the segments is way be- 
low the system size. Shown is also the scaling regimes where 
S{q) ~ and S{q) ~ q^^^^ . From a fit to the curves one 
can determine the crossover value q^ which determines the 
screening length. 
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FIG. 11: The ratio between the largest and the smallest eigen- 
value of the gyration tensor. Eigenvalues are the squared 
lengths of the gyration ellipsoids main axes. While for RW 
or SAW polymers the tensors show a characteristic aspheric- 
ity, compact polymers have an almost spherical shape in the 
asymptotic limit. 



eigenvalues Ai < A2 < A3 give the squared lengths of 
the principal axes of gyration. The ratios of the eigen- 
values indicate the deviation from a sphere-like shape of 
the polymer. It is well-known, for example, that the gy- 
ration tensor for self-avoiding walks and random walks 
in good solvent has a pronounced asphericity. This as- 
phericity shows up in the asymptotic ratios of the eigen- 
values, namely |23l.l24| 



(A3> 
(A3> 



(A2> 
(A2> 



(Ai> 
(Ai> 



12 : 2.7 : 1 
14 : 2.98 : 1 



for a RW 
for a SAW 



For Hamiltonian paths it is clear that there can be no 
asphericity in a symmetric simulation box as every lattice 
site is occupied. However it is not clear a priori that this 
holds true for less dense systems, where a crossover to the 
self-avoiding walk behavior might occur. Fig. [Tl] shows 
the ratio (A3) : (Ai). For p = 0.1 a pronounced deviation 
from the symmetry shows up for small N, but evidently, 
this deviation vanishes for large system sizes. 



APPLICATION OF THE RESULTS TO 
EXPERIMENTAL DATA 

In this section wc apply the findings from the previ- 
ous sections to a current research field of biophysics: the 
folding of chromatin. Chromatin is a compacted state 
of DNA, which can be found inside the nucleus of eu- 
karyotic cells |25|. This compacted state arises when 
the DNA double strand is wrapped around histone cores 
forming the so-called nucleosomes which in turn arrange 
in a beads-on-a-string-like manner forming the chromatin 
fiber. Amazingly little is known about the higher-order 



organization of chromatin inside the interphase nucleus. 
Indeed, about 2 meters of double-stranded DNA have to 
be densly packed into a nucleus of only 10 fim in di- 
ameter while still parts of the DNA have to be accessi- 
ble for large proteins. The detailed folding mechanisms 
are still a mystery. One reason for the slow advance in 
the field of chromatin folding is that imaging techniques 
do not allow one to follow the chromatin fiber along its 
contour inside the living cell. High-resolution techniques 
like EM are too invasive to maintain the structure one 
wants to observe. Therefore, one has to rely on indirect 
approaches. One such approach is fluorescence in situ 
hybridization (FISH). Here two fluorescent markers are 
positioned along the contour of the chromatin fiber hav- 
ing a certain genomic distance g between them. Confocal 
light microscopy then allows one to determine the spatial 
distance between these two markers. 

In an earlier study \vi\ we presented a model for chro- 
matin organization, the random loop (RL) model. This 
model assumes that random chromatin-chromatin inter- 
actions play a dominant role for the structural organiza- 
tion of chromatin in the interphase nucleus. The random 
loop model predicts a levcling-off in the mean square dis- 
placement between two FISH markers in relation to the 
genomic distance. This has been nicely confirmed by ex- 
periments 0. Importantly, the RL model explains the 
folding of chromatin without assuming a confined geom- 
etry. This is consistent with experimental data showing 
that the leveling-off takes place well below the diameter 
of the cell nucleus. 

Here wc want to reanalyze the experimental data 
keeping an eye on the type of compact polymers stud- 
ied above. Obviously, compact polymers also show a 
leveling-off in the mean square displacement. With re- 
spect to Hamiltonian paths this is necessary, as there is 
the confined space of the cubic box. However this is a 
general feature of compact polymers. Therefore the ques- 
tion may be asked whether there is a difference between 
the random loop polymers and generic compact poly- 
mers. Indeed, compact polymers arise whenever attrac- 
tive interactions begin to play a dominant role in com- 
parison to entropic forces. The random loop model also 
assuming attractive intra-chromatin interactions. The 
important difference is that the random loop ensemble 
contains an average over a disorder given by the ran- 
domness in the loop attachment points. 

Fig. [12] shows the moment ratios which have been stud- 
ied in Figs. [4] and [5] together with the experimental data. 
The errorbars represent the propagated standard error of 
the measurements. They are quite large as we look on 
higher-order moments and the number of measurements 
per genomic distance is in the order of 50-100. Shown 
are also results from the RL model for a chain with- 
out excluded volume of length N = 1000 and a loop- 
ing probability of p = 7 x 10~^. Strikingly, in all three 
cases the moment ratios for the experimental data are 
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larger than for a random walk, self-avoiding walk. This 
holds especially true for compact polymers where the mo- 
ment ratios are even smaller. Note that for simplicity 
Fig. [T21 shows the ratios for the end-to-end distances, as 
we have shown in Fig. [5l the ratios for intrachain dis- 
tances are even smaller. Obviously chromatin organiza- 
tion is not just a compacted state in terms of polymer 
physics, i.e. a chain in a poor solvent. The data suggests 
that there are structural features causing the fluctuations 
to be larger than even for the random walk model. The 
RL model offers an explanation for these experimental 
findings: Actually, chromatin-chromatin interactions are 
quite dynamic and there is a cell-to-cell variation not only 
in the thermal ensemble but also in the ensemble of dif- 
ferent loop configuration. This disorder average gives rise 
to moment ratios extending well beyond the random walk 
limit (see Fig. [T2|) . However, the RL model still shows 
significant deviations from the experimental data. This 
is not surprising as the RL model as presented here as- 
sumes a homogeneous structure along the polymer while 
it is well-established that this is not the case [1, . Nev- 
ertheless, this analysis clearly shows that the RL model 
extends beyond standard polymer models with attractive 
interactions and is in better agreement with experimen- 
tal data on chromatin folding without assuming any a 
priori confinement of the chromosomes as has been done 
in other studies 27 1. 



CONCLUSIONS 

In this study we have investigated the conformational 
and statistical properties of an ensemble of compact poly- 
mers. 

The algorithm we use here is basically the one pro- 
posed by Mansfield [l3| for unbiased sampling Hamil- 
tonian paths on a cubic L x L x L lattice. As we are 
interested not only in Hamiltonian paths, but a broader 
class of compact polymers we modified the algorithm by 
allowing vacancies on the cubic lattice such that we are 
able to sample compact polymers in the regime where 
T 7^ — or in terms of the lattice occupancy fraction 
/5 ^ 1. Using a highly parallel computing system we sam- 
pled conformations up to chain lengths of = 256 000 
and three different densities p ~ 0.1, p = 0.5 and p = I. 

While a lot of studies are devoted to the properties of 
compact polymers, little attention has been paid to the 
end-to-end distance distribution as well as to the distri- 
bution of intrachain segments within a globular polymer. 
However, this information is crucial for comparison to ex- 
perimental data • Wepropose that the scaling function 
introduced by Fisher [20| for the end-to-end distance dis- 
tribution also holds approximately for compact polymers. 
We determine the exponents S and /x (see eq. ([3])) by a 
fit to the simulation data. One of the findings is that 
the scaling exponents do not depend on the density of 



the system studied. While the end-to-end distributions 
contain a very detailed information on the underlying 
polymer model, it is often sufficient to look at the first 
moments of this distribution. We analyze three dimen- 
sionless ratios of the end-to-end distribution function and 
show that deviations from a self-avoiding walk behavior 
become apparent from the fourth moment on when look- 
ing at the ratio {R'^) / (i?^). 

From an experimental point of view it is often more in- 
teresting to know the distance distribution between two 
monomers along the contour of the chain. Therefore we 
analyze the distance distribution of the end points of seg- 
ments of length Nrn inside a compact polymer of length 
A^. We find that on very short contour lengths Nm there 
is a random walk behavior (screening effect) while for 
larger contour lengths the moment ratios pretty fast fall 
below the ratios of the end-to-end distances. 

We also investigate the screening of excluded volume in 
compact polymers, which was expected from mean-field 
theory to exist not only in polymer melts, but also in 
compact polymers [H. We find that there is a dominant 
crossover from the scaling (i?^) ~ N-^'^ to (i?^) ^ A^ with 
increasing density (Fig. [9]). To estimate the screening 
length we carefully evaluate the structure function and 
find (Fig. [T0|) that the behavior is indeed similar to a 
polymer melt. 

The aim of this study is not purely to enlighten us con- 
cerning the theoretical knowledge of compact polymers, 
but also to apply the findings to experimental data. One 
prime example where the results on distance distribu- 
tions can be directly compared to is the organization of 
the chromatin fiber inside the human interphase nucleus. 
With fluorescent labeling experiments one can determine 
the distance between two markers separated by a certain 
genomic distance g, which is related to contour length 
Nm in the simulations. We find that the chromatin fiber, 
although in a compact state that pretty much resembles 
a globular polymer behavior, shows significant deviations 
from these type of polymers. We propose that this is due 
to the dynamics of loop formation and unfolding, which 
is described well by the random loop model 0, [l3l • 

Monte Carlo simulations have been performed on the 
HELICS2 cluster at the Center for Scientific Comput- 
ing (IWR) in Heidelberg. M.B. gratefully acknowledges 
funding from the Landesgraduiertenforderung Baden- 
Wiirttemberg and support from the Heidelberg Graduate 
School of Mathematical and Computational Methods for 
the Sciences. 
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FIG. 12: Experimental data from FISH measurements 0]- 
Shown are the moment ratios of the measurements together 
with the predicted values for the random walk (RW) , the self- 
avoiding walk (SAW) and the globular state (GS). Obviously 
the data has significantly higher moment ratios than these 
classical polymer models. The RL model comes closer to ra- 
tios of the experimental data by virtue of disorder averaging. 
However it does not explain the detailed structure of the chro- 
matin regions studied. 



